Algorithms for Computing the Quartet Distance
نویسنده
چکیده
Evolutionary (Phylogenetic) trees are constructs of the biological and medical sciences, their purpose is to establish the relationship between a set of species (phyla). Often it is the case that the true evolutionary tree is unknown and one can only try to estimate it. Reconstruction methods are manifold and the resulting evolutionary trees are not guaranteed to be correct. In order to establish the quality of constructed evolutionary trees one might consider a similarity measure on such trees. The quartet metric is one such possible measure. Quartets are trees containing precisely four species. A quartet has one of four possible topologies. The quartet distance between two trees is the number of quartets, containing the same four species, having different topologies. We have developed a framework for comparing evolutionary trees according to the quartet metric. Three applications of this framework have been analysed in this thesis. We have considered trees of arbitrary degree, trees of constant degree and finally considered calculating the pairwise quartet distance between multiple trees. Each of these applications have led to algorithms superior to existing algorithms with respect to both time and space.
منابع مشابه
Computing the Quartet Distance Between Trees of Arbitrary Degree
We present two algorithms for computing the quartet distance between trees of arbitrary degree. The quartet distance between two unrooted evolutionary trees is the number of quartets—sub-trees induced by four leaves—that differs between the trees. Previous algorithms focus on computing the quartet distance between binary trees. In this paper, we present two algorithms for computing the quartet ...
متن کاملEfficient algorithms for computing the triplet and quartet distance between trees of arbitrary degree
The triplet and quartet distances are distance measures to compare two rooted and two unrooted trees, respectively. The leaves of the two trees should have the same set of n labels. The distances are defined by enumerating all subsets of three labels (triplets) and four labels (quartets), respectively, and counting how often the induced topologies in the two input trees are different. In this p...
متن کاملAlgorithms for Computing the Quartet Distance between Trees of Arbitrary Degree
We present two algorithms for computing the quartet distance between trees of arbitrary degree. The quartet distance between two unrooted evolutionary trees is the number of quartets—sub-trees induced by four leaves—that differs between the trees. Previous algorithms focus on computing the quartet distance between binary trees. In this paper, we present two algorithms for computing the quartet ...
متن کاملComputing the All-Pairs Quartet Distance on a Set of Evolutionary Trees
We present two algorithms for calculating the quartet distance between all pairs of trees in a set of binary evolutionary trees on a common set of species. The algorithms exploit common substructure among the trees to speed up the pairwise distance calculations, thus performing significantly better on large sets of trees compared to performing distinct pairwise distance calculations, as we illu...
متن کاملComputing the Quartet Distance Between Trees of Arbitrary Degrees
Comparing trees with regard to their topology is in itself an interesting theoretical problem in computer science, and furthermore researchers working in the interdisciplinary field of computational biology need tools to compare phylogenetic trees, i.e. trees that describe the relation of species according to evolutionary history. Different methods and different information can result in differ...
متن کامل